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I.     INTRODUCTION    £^1)    REVIEW    OF    RELEVANT    LITERATURE 

A.       INTRODUCTION 

This  thesis  is  an  investigation  of  nethods  for 
predicting  the  rate  of  reenlistment  in  the  aned  forces, 
specifically  the  Navy.  Since  the  advent  of  the  all  volunteer 
force  in  1973,  one  of  the  major  concerns  of  the  military  has 
been  the  retention  of  qualified  personnel  beyond  their  first 
enlistment.  Referred  to  as  first  term  reenlistn ent s,  this 
decision  has  been  the  object  of  extensive  study  and  modeling 
by  each  of  the  services.  The  vast  majority  of  this  work  has 
centered  around  the  formulation  of  causal  molels  witn  a 
heavy      emphasis    on      economic   factors.  During   periods      when 

reenlistments  have  been  below  the  required  levels,  these 
models  have  been  quite  good  at  capturing  the  effect  of  the 
economic  factors  used  to  suggest  the  level  Df  monetary 
compensation.  Over  the  past  four  years,  this  situation  has 
changed  to  the  point  where  the  services  are  faced  with  such 
large  numbers  of  personnel  desiring  to  remain  in  the 
services,  that  high  reenlistment  is  actually  Lowering  the 
numbers  of  personnel  that  can  be  enlisted  for  some  ratings 
at  the  recruiting  stations.  As  always,  there  is  still  a  need 
for  more  personnel  in  the  nuclear  related  ratings  but  seme 
of  the  less  technical  fields  are  approaching  an  end  point 
where  only  a  limited  number  of  billets  may  be  available  to 
new    accessions. 

The  object  of  this  thesis  is  to  attempt  to  construct  a 
short  term  model  that  could  aid  in  predicting  first  term 
reenlistments  for  five  selected  Navy  ratings.  Initial  models 
will  be  developed  utilizing  the  3ox-Jenkins  method  of  time 
secies      analysis.         These      initial    models      will      be    used      to 
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predict  the  level  of  reenlistment  for  the  selected  ratings. 
Next,  a  leading  indicator  model  will  be  develop9d  utilizing 
the  national  unemployment  rate  for  20-24  year  oils.  Then,  a 
refined  forecast  will  be  developed  combining  both  time 
series    and    causal   models. 

The  potential  advantage  of  time  series  analysis  lies  in 
its  simplicity  and  lack  of  reliance  on  external  factors.  To 
many,  this  is  viewed  as  a  shortcoming  since  it  cannot 
explain  causal  relationships  such  as  the  effect  of  adver- 
tising dcllars  on  sales  for  a  company.  However,  the  effec- 
tiveness of  advertising  dollars  is  not  easy  to  determine  and 
in  many  cases,  may  lead  to  false  conclusions  rfhen  used  in 
classical  regression  analysis.  Other  errors  such  as  the 
autocorrelation  of  an  independent  variable  or  tte  raulticcl- 
linearity  of  several  variables  are  also  avoided  in  the  use 
of  time  series  analysis.  Since  time  series  anaLysis  relies 
on  its  ability  to  reproduce  itself  over  time,  this  allows  it 
to  be  free  of  the  errors  of  regression  and  still  retain  the 
ability  to  adequately  forecast  events:  or  ia  this  case 
levels  of  reenlistment.  A  more  thorough  explanation  of  the 
Box-Jenkins   method    is  presented    in   Appendix    B. 

If  a  time  series  model  is  accurate  at  predicting  changes 
in  trends  as  well  as  levels  of  reenlistment,  then  it  may 
also  be  useful  as  a  tool  to  adjust  the  levels  of  reenlist- 
ment bonuses  to  the  most  cost  effective  level  aecesarry  to 
retain  the  desired  force  levels.  If  a  model  can  accurately 
foresee    a   significant  increase      in    reenlis tments,  indepen- 

dent of  the  reenlistment  bonus,  the  bonus  le/el  for  that 
rating  can  be  scaled  down  appropriately  to  retain  personnel 
without  the  payment  of  an  economic  rent.  (Rent  ,  in  this 
case,  is  the  payment  of  a  bonus  for  a  decision  independent 
of  the  bonus  award  level.)  To  summarize,  time  series 
analysis  may  not  prove  to  be  a  panacea  in  forecasting  reen- 
listment rates  but  it  most  definitely  is  a  tool  that 
deserves    wider    consideration   and    application. 
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B.        REVIEW    OF    RELEVANT    LITERATURE    IN    THE    FIELD 

1  .      Overview 

To  the  best  of  the  author's  knowledge,  the  applica- 
tion of  Box-Jenkins  Time  Series  Analysis  to  : eenlistment 
models      has      not      been      previously      attempted.  A      search, 

conducted  by  the  Defense  Technical  Information  Center 
(DTIC) ,  utilizing  both  title  and  subject,  did  not  reveal  any 
references  to  its  use  of  Box-Jenkins  modeling  for  reenlist- 
ment.  Bepko  [Bef.  1],  in  his  thesis  modeling  :areer  petty 
officers  reviews  the  relevant  literature  concerning  first 
term  reenlistment  and  career  reenlistment  from  1974  to  the 
publication  of  his  thesis  in  1981.  This  review  will  address 
relevant    publications  since    that    time. 

2«       The    Annualized   Cost    of    Leaving    (ACOL)     Molel 

This  model  was  developed  for  the  Navy  by  John  Warner 
at  the  Center  for  Naval  Analysis  and  is  currently  the  most 
widely  used  model  in  the  Navy  with  relation  to  manpower  and 
personnel   policy    decision  making. 

The  model  itself  is  a  sophisticated  multiple  linear 
regression  that  attempts  to  capture  several  important  under- 
lying forces  in  the  reenlistment  decision.  The  general  model 
is    of    the   form: 


M    /(1+r)       +    IJ 


♦    R^     /(1  +  r) 


Vt 


-(Wt+    R    i    (egn    1.1) 


where; 

1.  C (^n)  =  Net  present  value  of  pecuniary  and  non- 
pecuniary  returns  of  staying  in  the  military  until 
time    'n'    as    compared    to    leaving    at    time    't'. 
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2.  h  (j)  =  Monetary  returns  to  military  service  from 
period    't'    through    '  n' 

3.  W  (n)  =  Lump  sum  payment  of  the  present  /alue  of  the 
expectsd  post  service  civilian  wages  realized  by 
those   staying    in    the    military    until    time    'n'. 

4.  R  (n)  =  Lump  sum  payment  of  the  present  /alue  of  the 
expected  retirement  benefits  realized  by  those 
staying    in   the   military    until    '  n' 

5.  W  (t)  =  Present  value  in  year  't'  of  the  expected 
civilian  wages  realized  by  those  leaving  the  military 
in   year    ' t' . 

6.  P.  (t)  =  Present  value  in  year  't'  of  the  expected 
civilian  retirement  payments  for  those  Leaving  the 
military    in    year    't' 

7.  r   =    personal    discount   rate 

As  can  be  seen,  this  model  relies  on  de/elopment  of 
several  sub-models  relating  to  civilian  wage  structure, 
future  policy  decisions  concerning  lump  sum  boius  payments 
and  an  individual1 s  personal  discount  rate  over  time.  While 
the  results  obtained  with  this  model  have  been  superb,  the 
model    itself   is      complex   and   somewhat    difficult.  Since    the 

introduction  of  the  general  model  described  abo/e,  it  has 
been  further  refined  to  include  a  'taste'  factDr  for  mili- 
tary service  and  a  random  disturbance  term  whici  is  used  to 
capture  the  effects  of  sea  shore  rotation,  poor  I uty  station 
and  family  separation.  These  factors  improve  tie  model  but 
at    a    cost   of  aver    increasing   complexity. 

3.      Darling^s   Model    of    Marine    Corp   Enlistments 

Darling  [  Ref»  2],  utilized  a  combination  of 
Box-Jenkins  time  series  analysis  and  multiple  linear 
regression  to  predict  the  supply  of  upper  mental  category 
recruits  to  the  Marine  Corp.  The  procedure  entaiLed  develop- 
ment   of    three   separate   models    using    two    distinct    technigues, 
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the  first   was  a  standard   multiple  regression  of   the  logit 
form  : 

S(M)  =      a-M0  /[t  Mo    +(a-  t  M0)e  (eqn    1.2) 


where ; 

1.  S   =    Tha    supply  of    military    recruits 

2.  M    =    The    military    wage 

3.  a    =    Tha    stochastic   error    term 

In    this    phase   of    development,         the    following    variables    were 
introduced: 

1.  Civilian    military   pay   ratio 

2.  National    unemployment   for    16-19    year    old    Dales 

3.  Monthly    leads    from    print    media 

4.  Number   of    Marine   recruiters 

5.  Dummy   variable  to   account    for      anomalies    in    a    partic- 
ular  time    period. 

Further    analysis    Led    to    inclusion    of    items    one,    two    and    five 
from    the   abova    list    in   the    final    regression    modal. 

Next,  a  totally  independent  model  based  on 
Box-Jenkins  methodology  was  developed  for  Marine  accessions. 
This  was  a  univariate  model  whose  purpose  was  to  capture  the 
effect  of  seasonality  that  was  missed  by  the  regression 
model.    The    Box-Jenkins   method    yielded    a    model    of    the    form: 


yt  m4>,h-i-<t>,\*  *V*+0o+et-e/*e,_ 


(egn  1.3) 


12. 


where ; 

1.  Y  =  The  number  of  high  school  graduates  in  mental 
categories  I  and  II  that  enlist  in  the  Marine  Corp  in 
month  t. 
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=    The    autor egressi ve    coefficient    that   describes   the 
model 
3.       9  =    The    moving   average    coefficient    that   describes    the 
model 
"4-       e   =    The    error    term   of    the    model. 

These  two  models  were  initially  used  to  generate 
independent  forecasts  of  the  actual  enlistment  sapply.  5oth 
models  were  adequate  in  capturing  trends  in  the  actual 
enlistment  rate  but  were  somewhat  deficient  in  the  actual 
numbers  generated.  At  this  point,  Darling  combinad  the  tech- 
niques by  utilizing  the  Box-Jenkins  method  to  tate  advantage 
of  the  high  degree  of  serial  correlation  remaining  in  his 
regression  models  residuals  by  modeling  the  residuals  as  a 
separate  time  series.  The  result  of  this  method  was  to 
predict  'error'  terms  to  be  applied  to  the  resalts  of  the 
regression  equation.  This  combined  model  was  simmarized  by 
.eqn    1.4    shown    below; 


LSVc    =    LSVmr    +    Zt«  (eqn    1.4) 


where; 

1.  LSVc   =   The   predicted    value    of    the   combinel    model 

2.  LSVmr     =    The      predicted      value      using    the      regression 
model  only 

3.  Zt'    =  The      Box-Jenkins    model    of   the      resiiuals    of   the 
regression   model 

The  resulting  combined  forecasts  were  sa bst antially 
better  than  either  of  the  techniques  could  acaieve  sepa- 
rately and  was  extremely  accurate  in  capturing  the  general 
trend   of   the  enlistment    supply. 
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14  .      Be  pko1  s    Regression    Model 

Bepko  [Ref.  1],  concentrated  his  efforts  in  ieveloping  a 
multiple  regression  model  for  forecasting  career  retention 
beyond  the  first  enlistment.  This  model  was  unique  in  that 
it  evaluated  the  Navy  Rating  structure  by  occupational 
groupings  and  utilized  an  age  specific  index  for  introducing 
unemployment  into  the  model.  This  was  the  first  tfork  to  look 
at   ratings    and    unemployment    together    in    tnis    manner. 

Bepko' s  [Ref.  1  ]#  overall  findings  were  that  evalu- 
ating the  reenlistment  decision  by  groupings  i mong  caree- 
rists yielded  more  relevant  models  for  the  application  of 
bonus  payments  and  that  unemployment  among  th  =  25-39  age 
group  was  a  very  significant  factor  in  the  u  eenlistment 
decision   of   careerists. 

5.      The   Thomas-Liao    Model    of    Careerists 

The  work  of  Thomas  and  Liao  [Ref.  3],  continued 
along  the  same  track  as  Bepko  [Ref.  1 ]#  in  the  examination 
of  the  reenlistment  rate  among  careerists,  or  tiose  consid- 
ering their  second  or  subsequent  reenlistment.  The  differ- 
ence in  this  model  is  the  unique  grouping  consideration 
given  to  the  eating  structure,  where  the  ratings  are  aggre- 
gated by  the  patterns  of  their  past  reenlistment  percent- 
ages.   The   effect    of    this   appears    to    better 

capture  the  effect  of  the  significant  /ariables  on 
the  reenlistment  decision.  The  variables  utilized  in  this 
study  were  national  unemployment,  the  civilian  nilitary  pay 
ratio  and  tenure  as  expressed  by  years  of  sac  vice.  The 
results  of  the  predictions  generated  by  the  regression  equa- 
tion were  excellent  and  generally  less  than  13%  in  total 
err  or. 
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II.  APPLICATION  OF  EOX- JEN  KINS  ANALYSIS  TO  THE  J N IV AH I ATE 

MODELS 

This  chapter  presents  the  analysis  of  re-enlL s tment  data 
by  the  Box-  Jenkins  technique.  Box-Jenkins  analysis  involves 
three    steps; 

1.  Identification  -  This  involves  analysis  } £  the  time 
series  plots  cf  the  raw  data  to  try  and  discern  any 
obvious    trend    or    seasonality 

2.  Estimate  -  This  step  involves  analysis  oc  the  auto- 
correlations and  partial  autocorrelations  to  provide 
an   estimate    for   an   initial    model 

3.  Forecast  -  This  step  involves  running  tha  models  and 
generating  predictions  which  are  then  evaluated  for 
their  adequacy. 

Should    any      model   prove    inadequate,         the    estimation      of    the 
model    is   re-evaluated   to   find    a    more    suitable   molel. 

A.        INITIAL    ANALYSIS    OF    THE    DATA    SETS 

The  Box-Jenkins  method  was  applied  to  data  sets  of  the 
number   of   first    term    reenlistees    for    the    followiig    ratings: 

1.  Yeoman    (YN) 

2.  Storekeeper     (SK) 

3.  Operations   Specialist    (OS) 

4.  Electronics    Technician     (ET) 

5.  Boiler   Technician    (BT) 

These  ratings  were  selected  foe  analysis  because  they 
presented  a  representative  mix  in  mental  categor/  groupings, 
varying  degrees  of  general  and  specific  training  and  also 
provided  sufficient  numbers  of  reenlistments  to  perform 
valid   analysis. 
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The  data  consisted  of  monthly  summaries  of  first  term 
reenlistment  percentages  for  the  subject  ratinjs  covering 
the  period  from  October,  1980  through  September,  1983,  these 
data  were  provided  by  Mr.  James  McEwan,  the  statistician  for 
the  Re-enlistment  Programs  Development  Office  (OP-136),  a 
summary      of    the      data  is      shown    in      table    I.  Reenlistment 

percentages  for  these  ratings  ranged  from  a  low  of  41.2  per 
month  (BT)  ,  to  a  high  of  9  1.2  per  month  (ET).1  It  should  be 
noted  here  that  the  time  series  plots  of  tae  raw  data 
neither  suggest  any  clear  cut  trends  or  seasonality,  nor  are 
the    series    similar   across   the   ratings. 

The  time  series  plots  on  the  following  pag3s  represent 
the  percentages  of  reenlis tments  in  each  of  the  selected 
ratings,  the  vertical  axis  is  the  reenlistment  percentage 
and    the    horizontal      axis   is    the    time    line.  Th2    origin    for 

the  time  line  is  October,  1980  and  the  eni  point  is 
September,    1933. 

TABLE    I 
SOMHARY    STATISTICS    OF    REENLISTMENT    RATES 

RATE  DATES  MEAN  ST.DEV. 

YN  10/80-8/83  55.5  11.9 

SK  10/80-9/83  49.5  13.2 

OS  10/80-9/83  43.6  18.0 

ET  10/80-9/83  91.4  0  6.5 

BT  10/80-9/83  41.2  14.7 


1These    numbers    are   to   be   viewed      in    a    relati/e    sense    for 

their      ability    to      provide      sufficient  population    size      for 

their      respective      models      and    not      as  any      measurement      of 

"health"    in   a    particular    rating.       This  is    to   say    that    these 

percentages    do    not    necessarily    measure  need    in    a    particular 
rating,       but  rather    the      percentage    of   eligibles    reenlisting 

In      addition,         time      series      plots    of  each      data      set      are 
presented    in   Fig.    2.1    through    Fig.    2.5 
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B.        EXAMINATION  OF  AUTOCORRELATIONS  ANO  PARTIAL 

AUTOCORRELATIONS 

Autocorrelations  describes  the  associatiDn  between 
values  of  the  same  variable  but  at  different  time  periods. 
Autocorrelation  coefficients  provide  important  information 
about    the    structure      of    a    time   series.  These   coefficients 

can  be  used  to  identify  trends  and  possible  seasonality 
within    the    data.       [Ref.    4] 

Partial  autocorrelation  is  used  to  identify  the  extent 
of  the  relationship  between  current  values  of  a  variable 
with  earlier  values  of  that  same  variable,  while  holding  the 
effects    of    all    other    time    lags   constant.       [Ref.    4  ] 

1  .       Yeoman 

Examination  cf  the  autocorrelations  foe  the  Zeomar. 
rating,  Fig.  2.6,  suggests  that  the  data  is  stitionary  and 
should  not  require  any  transformation  prior  to  model  anal- 
ysis. The  residuals  are  within  two  standard  errors  of  the 
mean      zero    and       appear      to      be   randomly      distribited.  The 

partial  autocorrelations,  Fig.  2.7,  decay  to  zero  rapidly 
and  appear  random  after  this  point  suggesting  tiat  an  auto- 
regressive    model   may   be   appropriate. 

2  •       St orekee  per 

Examination  cf  the  autocorrelations  fo:  this  data 
set  indicated  the  residuals  met  the  randomness  criteria  of 
two  standard  errors.  The  shape  of  the  residuals  appeared  to 
be  a  decaying  sine  curve  Fig  2.8  The  partial  autocorrela- 
tions showed  no  shape  but  dropped  to  zero  graduaLly  and  were 
randomly  distributed,  Fig.  2.9,  not  suggesting  aay  clear  cut 
model. 
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Figure    2.6  AUTOCORRELATIONS    FOR    YN    RATING. 
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Figure    2.7  PARTIAL    AUTOCORRELATIONS    FOR    YN    RATING. 

3-      Operations    S pecialist 

Analysis    of    the   autocorrelations,         Fig.       2.10,       and 
partial    autocorrelations,         Fig.      2.11,       for      ths    Operations 
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Figure    2-8  ADTOCO EREIATIONS    FOR    SK    RATE NG. 
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PARTIAL 

AUTOCORRELATIONS    FOR 

SK 

RATING. 

Specialist  data  set  again  indicated  that  the  residuals  were 
randomly  distributed  and  had  characteristics  which  strongly 
suggested  the  use  of  some  type  of  a utoregressir e  operation 
in    model   selection. 
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Figure    2.10  AUTOC3 REELATIONS    FOR    OS    RATE NG. 
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Figure    2.11  PARTIAL    AUTOCORRELATIONS    FOR    OS    RATING. 

4-      Electronics    Technicians 

The    data    set    for      the    Electronics    Technicians    exhib- 
ited   a   strong      trend    when    the    residuals      were    evaluated,      in 
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addition  there  was  some  non-s tationarity  suggested,  which 
could  require  differencing2  to  remove.  Fig.  2.12  illustrates 
the  shape  of  the  ACF  function.  Evaluation  of  tie  resultant 
autocorrelation  showed  that  this  process  may  not  be  neces- 
sary as  the  first  lag  exceeds  -0.5  in  magnitude  which  is  a 
classic    indication    of   an    overdif f erenced    data    set . 

The  shape  of  the  autocorrelation  and  the  partial 
autocorrelation,  Fig.  2.13,  suggest  that  an  aut oregressive 
or   possibly  a   mixed    model    may    be    appropriate    for    evaluation. 
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Figure    2.12  AUTOCORRELATIONS    FOR    ET    RATE NG. 


2The  method  of  differencing  converts  non-stationary  time 
series  into  a  stationary  one.  It  consists  of  subtracting 
successive  values  from  one  another  and  using  ti  ere  differ- 
ence   as    a   new    time    series   [  Ref .    4]. 
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Figure  2.13    PARTIAL  AUTOCORRELATIONS  FOR  ET  RATING. 

5 •   Boiler  Technicians 

The  Boiler  technician  rating  data  set  exhibited  no 
strong  trend  in  the  autocorrelations,  Fig.  2.14.  The  shape 
of  the  autocorrelation  and  partial  autocorrelation,  Fig. 
2.15,  also  strongly  suggested  that  an  autoregress ive  type  of 
model  should  be  considered  for  evaluation.  This  was  indi- 
cated by  the  decaying  sine  wave  pattern  in  the  ACF  and  the 
abrupt  cutoff  of  the  value  of  the  PACF. 

C.   MODEL  DEVELOPMENT 

In  developing  the  models,  the  autocorrelations  and 
partial  autocorrelations  were  evaluated  against  representa- 
tive Box-Jenkins  models  of  the  au toregressiv3  (AR)  and 
moving  average  (MA)  type.  A  best  fit  model  was  ti  en  selected 
for  evaluation  utilizing  the  Minitab  General  Purpose 
Statistical  Computing  System.  The  results  of  these  models 
were  then  evaluated  to  ensure  the  residuals  wers  random  and 
less  than  two  standard  errors  of  the  mean  zero.  If  the  model 
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Figure  2-14    AOTOCOBBEIATIONS  FOB  BT  BATING. 
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Figure  2-15    PAB1IAL  AU10CCBBEIATIONS  FOB  BT  BATING. 

was  satisfactory  at  this  point,  the  sum  of  squared  errors 
(SSE)  was  evaluated  to  determine  if  it  was  the  lowest 
possirle  reduction  in  the  SSE.  In  the  event  that  mere  than 
one   model   passed   these   tests    the   t-ratios   were   then 
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evaluated.  Again,  if  all  these  tests  were  insignificantly 
different  among  a  fanily  of  models  the  principle  of  parsi- 
mony was  used  to  select  the  "test"  model.  In  every  model 
developed  during  this  phase  of  analysis,  all  three  of  the 
criteria  were  met  by  cnly  the  model  selected  which  made  the 
choice  of  the  correct  prediction  model  relatively  easy  to 
select. 

Once  these  models  were  selected,  forecasts  were  gener- 
ated for  the  period  October  1983  to  March  1984  and  compared 
to  the  actual  reenlistment  totals  for  the  period.  If  a  model 
was  evaluated  as  totally  inappropriate  at  this  point,  then 
further  investigation  and  modeling  was  pursued  to  attempt 
resolution  of  the  prctlem. 

lable  II  presents  a  summary  of  the  model  forecasts  for 
the  selected  ratings.  Model  summaries  and  statistics  are 
presented  in  appendix  C. 
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TABLE  II 
RESULTS  OF  UNIVARIATE  MODEL  FORECASTS 

95*  FORECAST  LIMITS 


PERIOD 

FORECAST 

LOWER 

UPPER 

ACTUAL 

ERRO 

YN    RATING 

OCT    8  3 

67.9 

46.3 

89.5 

60.7 

11.8 

NOV    83 

61.3 

37.6 

84.9 

55.6 

10.  1 

DEC    83 

58.3 

34.2 

82.4 

66.0 

11.6 

JAN    8  4 

57.0 

32.8 

8  1.1 

60.8 

17.  2 

FEB    84 

56.4 

32.2 

80.5 

59.1 

04.6 

MAR    84 

56.1 

31.9 

80.2 

51.8 

08.  2 

SK    RATING 

OCT    8  3 

60.4 

35.6 

85.2 

64.6 

06.  4 

NOV    8  3 

60.4 

35.1 

85.6 

69.8 

13.4 

DEC    83 

60.4 

34.7 

86.1 

70.8 

14.  6 

JAN    84 

60.4 

34.3 

86.5 

72.0 

16.  0 

FEB    84 

60.4 

33.9 

86.9 

66.7 

09.4 

MAR    84 

60-4 

33.5 

87.3 

75.0 

19.4 

OS    RATING 

OCT    83 

44.6 

13.4 

75.8 

41.5 

07.5 

NOV    83 

43.7 

08.8 

78.5 

41.2 

06.  0 

DEC    8  3 

43.2 

07.4 

78.9 

52.6 

17.8 

JAN    8  4 

43.0 

07.0 

78.9 

52.7 

18.  4 

FEB    8  4 

42.9 

06.9 

78.9 

48.9 

12.  3 

MAR    84 

42.8 

06.8 

78.8 

62.5 

31.  5 

ET    RATING 

OCT    83 

95.3 

84.7 

1  00 

95.9 

00.  6 

NOV    83 

93.6 

8  1.3 

1  00 

95.6 

02.0 

DEC    83 

92.6 

79.8 

1  00 

95.5 

00.3 

JAN    84 

92.1 

79.0 

1  00 

96.2 

04.  3 

FEB    84 

91.7 

78.6 

1  00 

94.2 

02.  6 

MAR    84 

91.5 

78.4 

1  00 

95.9 

04.5 

BT    RATING 

OCT    8  3 

52.5 

27.4 

77.6 

42.6 

23.2 

NOV    83 

47.2 

18.7 

7  5.8 

56.5 

16.  4 

DEC    8  3 

44.4 

14.9 

73.9 

48.0 

07.5 

JAN    8  4 

42.8 

13.0 

72.6 

52.0 

17.6 

FEB    84 

42.0 

12.1 

7  1.8 

55.6 

24.  5 

MAR    84 

41.5 

1  1.7 

7  1.4 

61.8 

32.  8 

33 


1 .       Yeoman,    M odel 

The    model  selected  for    the      Yeoman    ratinj     was    of    the 

ARIMA     (1,         0,       0)          type,  a      comparison    of      the    forecasted 

re-enlistment    percentages  with    the    actual    totals       showed   an 

average    error    of    .106    with  a   range      from   a    low   o'    -.172    to   a 

high    of    .118.         all    of   the  observations    were      within   the    95% 

confidence    limits      of   the  model   and      were   also      captured   in 

shape    by   the   aodel.       Fig.  2.16    graphically    illistrates    the 
forecasts   and   observations. 
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Figure  2.16    PLOT  OF  UNIVARIATE  MODEL  FOR  YN  RATING. 
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2 .  Storekeeper    Model 

Since  the  autocorrelations  and  partial  lutocorrela- 
tions  did  not  clearly  suggest  the  selection  o£  one  model 
type  as  being  superior  to  another,  an  iterative  method  was 
used  for  selection.  After  discarding  several  first  order  AR, 
MA  and  ARMA  models  it  was  decided  to  try  differencing  even 
though  the  data  did  not  strongly  suggest  that  this  was 
necessary.  The  resultant  ACF  and  PACF  satisfied  the  random- 
ness criteria  without  indicating  overdif f erencii g.  A  model 
of  the  AEIMA  (0,1,1)  order  was  then  found  to  neet  all  the 
necesarry  stringency  requirements  for  model  selection.  The 
model  generated  forecasts  with  an  average  error  of  .132  and 
a  range  from  -.064  to  -.194.  It  is  felt  that  iterative 
modeling  with  the  actual  data  for  the  forecast  period  would 
reduce  this  error  even  further.  Fig.  2.17  illistrates  the 
data    fit    with   the   model. 

3.  Operations    Specialist    Model 

Modeling  of  the  Operations  Specialist  rating 
resulted  in  the  selection  of  an  ARIMA  (1,0,0)  as  the  most 
appropriate  model.  The  data  for  the  rating  presented  the 
largest  fluctuation  in  range  over  the  entire  dati  set.  These 
fluctuations  have  no  doubt  influenced  the  ultimate 
predictive  power  of  the  selected  model.  As  a  result,  the 
model  yielded  acceptable  predictions  varying  fron  the  actual 
observations  by  an  average  of  .155  and  a  range  from  .06  to 
-.315.  As  with  the  previous  model,  successive  iterations 
with  the  new  observations  should  improve  the  predictive 
power  of  the  model.  Fig.  2.18  illustrates  the  models  fit 
with    the    actual    observations. 
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Figure    2.17  PLOT    OF    UNIVARIATE    MODEL    FOR    SK    RATING. 

4 •       Electronics    Technician   Model 

A  first  order  aut oregressive  model  of  the  ARIMA 
(1,0,0)  type  was  also  selected  for  the  electronics  techni- 
cian rating.  The  model  produced  spectacular  results  with  an 
average  error  of  -.025  and  a  range  of  errors  from  -.003  to 
-.045.  It  should  be  noted  that  this  data  set  is  also  the 
most  stable  over  time  with  an  average  of  more  thin  9  1%  first 
term  reenlistments.  While  this  makes  the  model'  s  job  some- 
what easier,  it  still  remains  as  a  powerful  model  for 
univariate  predictions.  Fig.  2.19  illustrates  ths  fitting  of 
the    mcdel. 
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Figure  2.18    PLOT  OF  ONIVARIATE  MODEL  FOR  OS  RATING. 

5 •   Boiler  Technician  Model 

In  tha  case  of  the  boiler  technicians,  an  ARIMA 
(1/0,0)  was  again  evaluated  as  the  most  i ppropriate, 
however;  this  model  was  the  poorest  predictor  of  any 
selected  for  evaluation.  The  average  error  was  .203  with  a 
range  from  -.328  to  .232.  The  model  also  failei  to  capture 
the  shape  of  the  actual  data  which  presented  an  jpward  trend 
while  the  model  indicated  a  downward  turn.  Tie  model  is 
however  acceptable  for  further  analysis  and  refinement  in 
the  transfer  function  model.  Fig.  2.20  illjstrates  the 
fitting  of  the  data  set  predictions. 
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Figure  2.19    PLOT  OF  UNIVARIATE  MODEL  FOR  ET  RATING. 

D.   SUMMARY 

In  this  chapter,  five  acceptable  univariate  modeLs  have  been 
developed  for  the  respective  data  sets.  These  molels  were  in 
most  cases  fairly  obvious  from  the  analysis  of  the  autocor- 
relations and  partial  autocorrelations,  however  the  process 
can  be  fairly  time  consuming  and  result  in  the  pursuit  of 
several  "blind  alleys"  on  the  way  to  a  workable  model.  In 
succeeding  chapters,  a  transfer  function  modsL  utilizing 
unemployment  statistics  for  the  20-24  age  gr^up  will  be 
developed. 
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Figure  2.20    PLOT  OF  ONIVABIATE  MODEL  FOR  BT  RATING. 
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III.  THE  LEADING  INDICATOR ,  REGRESSION  AND  COMBINED  MODELS 

A.  OVERVIEW 

A  leading  indicator  model  for  reenlistment  in  the 
selected  ratings  was  constructed  by  first  dsveloping  a 
univariate  time  series  model  for  unemployment  ii  the  20-24 
year  old  age  group  and  then  applying  this  model  to  the  data 
for  the  selected  ratings.  The  resultant  model  residuals  were 
then  cr osscorrelated  to  establish  the  location  of  any  time 
leads  or  lags  that  affect  reenlistment s.  These  indicators 
could  provide  an  early  warning  system  of  shifts  in  the  level 
of  reenlistment  and/or  the  direction  of  the  trend  in  reen- 
listment. Once  developed,  the  adeguacy  of  the  model  was 
tested  by  using  the  coefficient  of  determination  (R-sguared) 
from  the  indicated  lag/lead.  Forecasts  for  the  0:  t  8 3- Mar  84 
time  period  were  also  generated  in  this  procsss  and  the 
results      compared    to      the    univariate      models    forecasts.  A 

combined      model    using      time    series      analysis    and      regression 
were    alsc   formulated. 

B.  THE    DNEMPLOYMENT    MODEL 

The  data  for  unemployment  in  the  20-24  age  bracket  was 
collected  from  monthly  publications  by  the  Bureiu  of  Labor 
Statistics.  These  figures  cover  the  period  from  October  1980 
through  September  1983  and  are  presented  in  Appendix  A.  This 
particular  age  grouping  was  selected  as  being  the  most 
appropriate  for  personnel  completing  their  first  enlistment 
and    facing    the    reenlistment    decision. 
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1  •   1^1  hod  o  log  y_ 

The  unemployment  time  series  model  was  constructed 
utilizing  the  same  methodology  as  applied  in  tie  preceding 
chapter  to  reenlistment  time  series  for  the  selacted  rating 
models.  The  unemployment  data  was  initially  transformed  by 
computing  the  relative  percentage  change  from  oa e  period  to 
the  next.  This  was  done  by  subtracting  the  rate  in  the 
current  period  from  the  rate  in  the  preceding  period  and 
dividing  the  remainder  by  the  rate  of  the  preceding  period. 
In  doing  this,  it  was  felt  that  the  resulting  model  would 
better  capture  responses  to  changes  in  the  unemployment  rate 
rather  than  responses  to  the  overall  level.  It  was  further 
hypothesized  that  this  would  capture  any  perceptions  by  the 
service  member  that  the  job  market  was  improviag  or  wors- 
ening in  relation  to  the  demand  for  a  particular  skill 
[Ref.  5]. 

The  computed  change  in  unemployment  tim2  series  was 
then  evaluated  for  a  potential  model  by  screening  the  auto- 
correlation aad  partial  autocorrelation  functions.  The  data 
appeared  stationary  but  did  not  suggest  any  oovious  model 
for  selection.  As  a  result  a  trial  and  error  method  was  used 
for  model  selection.  The  model  iterations  were  evaluated  for 
suitability  utilizing  the  same  criteria  described  in  the 
previous  chapter,  that  is  evaluation  of  residuals  for 
randomness,  smallest  sum  of  sguared  errors  and  a  t-ratio  in 
excess  of  2.D.  Several  trials  of  autoregressl  ve  models, 
moving  average  models  and  mixed  autoregress i ve  moving 
average  models  did  not  yield  any  positive  results. 
Therefore,  it  was  decided  to  difference  the  data  one  time 
and  try  the  iterative  model  building  process  again. 

The  resulting  differenced  data  set  also  met  the 
stationarity  criteria  and  did  not  exhibit  any  characteris- 
tics of  an  over  differenced  data  set.   When  evaluated  at  this 
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point,  the  autocorrelation  and  partial  autDcorrelation 
suggested  that  a  moving  average  model  was  appropriate.  This 
model  of  the  ARIMA  (0,1,1)  type  met  all  of  tie  selection 
criteria  necessarry  and  was  therefore  adopted  foe  use.  Model 
results  and  specifications  are  presented  in  Fig.  3.1  through 
Fig.     3.4. 
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Figure  3.1 


TIME  SERIES  PLOT  FOE  UNEMPLOYMENT  DATA. 
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Figure  3.3    PARTIAL  AUTOCORRELATIONS  FOR  UNEMPL3YMENT  DATA. 
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TABLE    III 
SUMMARY    OF    UNEMPLOYMENT    MA     1     MODEL 


NUMBER 
1 


TYPE 
MA       1 


ESTIMATE 
0.9  850 


ST.    DEV. 
0.0689 


T-R\ TIO 
14  .29 


DIFFERENCING.     1     REGULAR 

RESIDUALS.        SS    =    896.9     (BACKFOREC ASTS    EXCLUDED) 

DF    =    34    MS    =    2  6.4 
NO.     OF    OBS.     ORIGINAL    SERIES    36    AFTER    DIFFERENCING    35 
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C.       APPLICATION    TO    THE    SELECTED    RATINGS    DATA    SETS 


1 .       Yeoman 

When  the  ARIMA  (0,1,1)  model  was  applied  to  both  the 
reenlistmen  t  data  for  yeoman  and  the  differenced  unemploy- 
ment data  set  and  the  cross-correlation  function  evaluated, 
there  appeared  to  be  a  relationship  at  the  twelve  month  lead 
point.  The  value  at  this  point  was  significant  wien  compared 
to  the  other  points  however,  it  was  not  significant  in  abso- 
lute terras  since  it  was  not  in  excess  of  two  standard  errors 
of  the  mean  zero.  This  observation  of  magnitude  holds  true 
in  all  of  the  models.  Fig.  3.5  shows  the  cross- correlation 
function   for   this   data   set    for    all    lead    months   only. 
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Figure    3.5         CROSS    CORRELATION    FOR    YN    RATING. 
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2.       Storekeepers 

The  cross-correlation  function  for  the  storekeeper 
model  indicated  a  possible  lead  indicator  relitionship  at 
the  five  month  and  eleven  month  points.  Both  of  these  points 
were  significant  in  their  relationship  to  the  Dther  values 
but  again  were  below  the  accepted  level  for  determining 
significance.  It  was  decided  to  investigate  the  significance 
of  these  points  in  the  regression  procedure.  Fig.  3.6  shows 
the  cross-correlation  function  for  positive  lei ds  of  this 
model . 
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Figure    3.6         CEOSS    COREELATION    FOE    SK    EATING. 


46 


3-      Operations    Specialists 

The  cross-correlation  function  again  iniicated  more 
than  one  point  for  possible  investigation  as  being  rela- 
tively significant.  These  points  occurred  at  ths  six,  nine 
and  twelve  month  points.  Fig.  3.7  again  shov s  the  lead 
values   for    this    model. 
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Figure    3.7  CROSS    CORRELATION    FOR    OS    RATING. 
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4.       Electronics    Technic  ians 

The  cross-correlation  function  for  this  model  also 
suggested  more  than  one  lead  point  for  investiga: ion,  these 
occurred  at  the  six  and  twelve  month  points  but  again  were 
significant  only  in  relative  terms  and  not  in  absolute 
magnitude.  Fig.  3.8  illustrates  the  lead  relationship  in  the 
cross-correlation  function. 
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Figure   3„8         CROSS   CORRELATION    FOR    ET    RATING. 
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5-       Boiler    Technicians 

Consistent  with  the  previous  models,  tie  model  for 
Boiler  Technicians  also  presented  more  than  one  prominent 
point  for  evaluation.  These  points  occurred  at  the  nine  and 
twelve  month  points.  Fig.  3.9  is  the  cross-correlation  func- 
tion   for    the   lead   values   of    the    model. 
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Figure    3.9         CROSS    CORRELATION    FOR    BT    RATING. 
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D.       LINEAR    REGRESSION 

In  order  to  verify  the  leading  indicator  models,  a 
linear  regression  will  be  constructed  using  20-2  4  year  old 
unemployment  as  the  independent  variable  and  reenlistment 
rates  for  the  selected  ratings  as  the  dependent  variable. 
For  this  process  to  be  valid,  certain  assun ptions  are 
required  prior  to  application  of  the  process.  The  first 
assumption  is  that  there  is  a  linear  relationship  between 
the    variables   as   described    by    egn.    3. 1 


Y      =   a   +    BX,     +    e  (egn    3.1) 


where  for  each  observation,  Y  is  a  random  variable.  The 
second  assumption  is  that  X  is  fixed  in  value  and  the  final 
assumption  is  that  e,  the  error  term,  has  an  expected  value 
of  zero  with  constant  variance  for  all  observations.  It  is 
further  assumed  that  the  e's  are  normally  distributed  and 
uncorrelated.      [ Ref .    6] 

E.       MODEL    VERIFICATION 

Linear  regression  models  were  run  for  all  of  tie  data  sets 
using  the  lagged  value  of  the  unemployment  data  as  the  inde- 
pendent variable  and  the  reenlistment  data  as  tie  dependent 
variable.  The  regressions  were  evaluated  using  tie  E-sguared 
value,  Durbin-Wat son  statistic  and  t-ratio  A  sun mary  of  the 
regression  models  R-squared  and  Durbin  Watson  statistic  is 
presented   in  Table    IV. 

The  regression  models  for  all  of  the  ratings  were  less 
than  robust  in  their  ability  to  verify  the  significance  of 
the  leading- indicator  models.  R-squared  values  ranged  from  a 
high    of    .34      for   the    Yeoman    model      to    a    dismal   -.  02      for    the 
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Boiler  Technician  model.  These  values  represent  the  best 
F.-sguared  that  were  obtained  at  all  lagged  values  of  the 
independent  variable  and  not  just  the  ones  that  were  indi- 
cated as  being  significant  in  the  leading  indicator  model. 
t-ratios  were  somewhat  stronger  ranging  f ro m  .1  for 
Storekeepers  to  11.3  for  Electronics  Technicians  with  three 
of  the  five  ratings  having  values  above  the  minimum  accept- 
ance level  of  2.0.  The  Dur bin-Watson  statistic  was  weak  for 
the  models  as  well  with  only  the  Storekeeper  no  del  clearly 
within  the  acceptable  range  of  1.5  to  2.5  foe  the  data. 
This  indicates  that  the  regression  failed  to  remove  all  of 
the  serial  correlation  present  in  the  data  sits  and  the 
residuals   are   positively   correlated    to    one    another. 

While  these  models  are  disappointing,  ti  ey  are  not 
discouraging.  They  seem  to  indicate  that,  when  taken  alone, 
unemployment  does  not  possess  the  strong  predictive  ability 
it  seems  to  have  in  other  econometric  models.  3epko 
[Ref.  1],  and  Darling  [Ref.  2],  in  their  regression  models 
attribute  nearly  50  percent  of  the  explained  sun  of  sguared 
error  to  unemployment,  this  may  indicate  that  this  relation- 
ship may  not  hold  in  modeling  the  behavior  of  first  term 
reenlistments  for  military  personnel.  It  shouL d  be  noted 
that  Bepko  [Ref.  1],  constructed  an  aggregated  model  of 
careerists  using  the  25-39  age  group  for  unemployment  and 
Darling  [Ref.  2],  utilized  national  teenage  unenployment  in 
modeling    Marine    Corp    enlistments. 

The  predictions  for  the  regression  model  are  compared  to 
the  observed  levels  of  reenlistment  for  the  period  October 
1983  to  March  1984,  these  forecasted  levels  are  shown  in 
table  VI  In  addition,  the  regression  forecast  for  the  entire 
data  set  from  October  1980  to  September  1983  wiLl  be  gener- 
ated and  the  residuals  for  that  data  set  will  be  indepen- 
dently modeled  as  a  new  time  series  with  a  forecast  of 
residuals   to  be    applied    to    the    forecasts    from    the    regression 
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TABLE    IV 
SUMMARY   OF    R-SQUARED    AND    DURBIN-ff ATSON    STATISTICS 


X=UNEMPLOYMENT     FOR    20-24    YEAR    OLD    AGE    GROUP 

SELECTED    RATINGS    E-SQUARED    VALUES 

LAGS      (X) 

(DURBIN-WATSON    STATISTIC    IN    PARENS) 

YN 

SK 

OS 

ET                  BT 

1 

.265 
(1.  54) 

.22 
(1.77) 

.027 
(.82) 

.  181 
(1.22)         | 

.03 
,.87) 

2 

.272 
(1  .56) 

-154 
(1.81) 

.078 
(.83) 

.141 
(1.2)           \ 

-.03 
.87) 

3 

.30 
(1.  55) 

.199 
(1.56) 

.025 
(.84) 

.  133 

(1.22)         | 

.031 

,.84) 

a 

.349 
(1  .43) 

.  163 
(1-72) 

.037 
(.81) 

.115 
(1.51)        { 

-.027 
,.83l 

5 

.213 
(1.3) 

.134 
(1.76) 

.031 
(.81) 

.041 
(1.39)         | 

.023 

;.84i 

6 

.259 
(1.31) 

.229 
(1.76) 

.006 
(.79) 

.  022 
(1-55) 

-.032 
;-79| 

7 

.272 
(1.25) 

.253 
(1.75) 

.  002 
(.8) 

.  003 
(1.6)           i 

-.035 

;.8D 

8 

.21 
(1.16) 

.224 
(1.6) 

.004 
(.79) 

-.009 
(1.  65) 

-.034 

;.74i 

9 

.148 
(1.16) 

.183 
(1.58) 

-.  023 
(.81) 

.  024 
(2.08) 

•.027 
[-73) 

10 

.171 
(1.  14) 

.193 
(1-69) 

-.  023 
(.79) 

.  024 
(2.08)        I 

-.027 
[-75) 

11 

.098 
(1.3) 

.115 
(1.97) 

-.  013 
(.74) 

-.011 
(2.08) 

-.043 
[-75) 

12 

.203 
(1.66) 

.130 
(2.24) 

-.  045 
(.71) 

.051 
(2.09) 

-.042 

;.8) 

models.  This  procedure  should  yield  forecasts  with  a  better 
fit  than  was  possible  with  only  the  univariate  or  regression 
models.  This  procedure  closely  parallels  the  work,  of  Darling 
[Ref.  2],  in  modeling  the  supply  of  recruits  for  the  Marine 
Corp.  The  combined  model  will  be  developed  in  the  next 
section. 
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F.       THE    COMBINED    MODEL 

The  regression  models  in  the  previous  section  yielded 
adequate  forecasts  of  the  reenlistment  rate,  thsy  were  not, 
however,  significantly  better  than  the  forecasts  for  the 
univariate  models  in  chapter  three.  Additionally,  the 
residuals  of  the  regression  model  exhibited  strong  positive 
serial  correlation  as  indicated  by  the  low  values  of  the 
Durbin-Watson  statistic  for  each  model,  Table  17  summarizes 
the  data  for  the  regression  models.  As  shown  by  Darling 
[Eef.  2],  this  enables  the  residuals  to  be  constructed  into 
an  independent  time  series.  Through  the  application  of  the 
Box-Jenkins  method  a  forecast  of  the  residuals  or  error 
terms  can  be  generated  and  applied  to  the  regression  models 
forecasts.  This  procedure  should  yield  a  forecast  with  a 
better    fit    to    the    actual   data. 


RATE 

YN 
SK 
OS 

ET 
BT 


LAG 

4 

7 

4 

1 

5 


TABLE    V 
REGRESSION    MODELS    FOR    SELECTED    RATINGS 


CONSTANT    BETA 
9.05         3.32 


1.  59 

73.2 
73.2 
52.6 


3.57 
-2.08 
1.31 
-.743 


ST.DEV 
9.97 
1  1.88 
16.82 
5.53 
14.  93 


R-SQRD 

.327 

.253 

.037 

.181 

-.023 


DUR3  IN- 

WATSON 

1.43 

1.75 

.81 

1.22 

.84 


The  Box-Jenkins  methodology  as  described  in  Chapter 
three  was  again  applied  to  the  sets  of  regression  model 
residuals    and   appropriate   models   were    selected.       Table    VI    is 
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a      summary      of       the      forecasts    for      all      three      nethods      and 
percentage    of   error    for    each    forecast. 

The  combined  model  resulted  in  improved  forecasts  in 
three  of  the  five  ratings  with  the  other  two  showing  either 
minor  improvement  or  a  slight  decline  (BT,  OS).  Et  should  be 
noted  at  this  point  that  these  two  data  sets  were  the  most 
volatile  in  terras  of  range  of  observations.  Tie  forecasts 
for  these  ratings  could  be  improved  by  eliminating  signifi- 
cant outliers  from  the  data  sets  and  recomputing  all  of  the 
models.  Due  to  the  already  small  size  of  the  dati  sets,  this 
was  not  considered.  This  problem  should  be  corrected  in 
future  works  in  this  area  as  more  data  points  become  avail- 
able   for   analysis. 
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IV-  SUMMARY  AND  CONCLUSIONS 


A.   SUMMARY 

Several  forecasting  techniques  have  been  s  xamined  in 
this  thesis  in  an  attempt  to  predict  the  pattecn  of  reen- 
listments  in  five  specific  ratings.  Two  distinct  methods 
were  used  to  build  three  models;  a  univariate  Box-Jenkins 
model,  a  linear  regression  model  and  a  combine!  regression 
and    Box-Jenkins    model. 

The  results  of  each  varied  in  predictive  ability,  with 
the  combined  model  being  clearly  superior  to  the  other  two, 
(as  measured  by  percent  error  of  the  actual  observations), 
but  with  the  results  by  rating  differed  sharply  within  each 
model.  For  electronics  technicians  all  three  models  were 
clearly  adequate,  this  is  not  surprising  since  this  rating 
had  the  smallest  range  and  the  least  variance  in  reenlist- 
ments  during  the  time  period  examined.  The  regrassion  equa- 
tion for  ET's  yielded  a  very  low  R-squared  value  of  the 
model.  Appropriate  additional  explanatory  variibles  may  be 
the  level  of  reenlistment  bonuses  or  the  availability  of 
advanced  technical  training.  Boiler  technicians  and  opera- 
tions specialists  showed  the  widest  range  in  reenlistment 
percentages,  and,  as  expected,  their  models  exhibited  the 
least  accuracy.  The  regression  equations  for  these  ratings 
were  counterintuitive  in  that  they  indicated  higher  reen- 
listment rates  at  successively  lower  levels  of  the  indepen- 
dent variable,  unemployment.  This  indicates  that  an 
additional  independent  variable  may  be  required  in  the 
equation  for  these  ratings.  For  BT's  this  may  be  a  dummy 
variable   accounting      for   the      unpleasantness    of       the    working 
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conditions  or  the  level  of  their  reenlistment  tonuses.  For 
the  OS  rating,  it  may  also  be  the  reenlistment  bonus  level 
or  a  factor  accounting  for  the  high  amount  of  sea  duty 
present  in  that  rating  when  compared  to  others.  The  ratings 
of  yeoman  and  storekeeper  presented  models  that  were 
marginal  when  using  Eox-Jenkins  or  regression  separately  but 
presented  quite  good  predictions  when  utilizing  the  combined 
model. 

A  somewhat  surprising  result  of  the  models  forecasts  was 
the  accuracy  of  the  regression  model  using  20-2  4  year  old 
unemployment  as  the  only  independent  variable.  This  is 
surprising  in  view  of  the  low  R-squared  values  of:  the  models 
and  the  high  degree  of  serial  correllation  remaining  in  the 
residuals  as  expressed  by  the  Durbin-Wa tson  statistic.  This 
was  actually  the  second  most  accurate  prediction  model 
outperforming  the  univariate  Box-Jenkins  model  Dy  a  slight 
margin. 

The  Box-Jenkins  models*  performance  was  restricted  by 
the  size  of  the  data  set  available.  Technically,  thirty  or 
raoce  observations  in  a  data  set  are  considered  sufficient 
but  100  or  more  observations  are  considered  desirable  in 
order  to  utilize  the  full  predictive  power  of  the  model. 
This  larger  number  of  observations  is  also  considered  desir- 
able in  terras  of  identifying  the  underlying  trends  and 
patterns  which  may  net  appear  in  a  smaller  set  of  data.  In 
terms  of  forecasting  reenlistments,  it  is  not  possible  at 
this  point  to  utilize  any  more  data  points  than  were  avail- 
able for  this  study  since  the  monthly  figures  ars  aggregated 
by  quarters  after  three  years  and  only  the  quarterly  data 
are    retained. 


57 


B.       CONCLUSIONS 

A  surprising  finding,  for  all  of  the  ratings  mods  led,  is  the 
continued  rise  in  reenli stments  in  view  o£  the  ever 
improving  economy  during  the  period.  This  could  possibly  be 
explained,  in  a  regression  model,  with  the  introduction  of 
the  civilian/military  pay  ratio  for  the  period  or  the  level 
of  reenlistmeat  bonuses  for  a  rating.  This  vould  still, 
however,  not  account  for  the  95  percent  reenlistment  rate 
for  electronics  technicians  who  are  generally  regarded  as 
having  the  most  desireable  and  marketable  skills  in  almost 
any  employment  market.  Another  explanatory  term  could  be 
introduced  for  "taste"  for  military  service  much  as  the  ACCL 
model  uses.  In  light  of  the  world  situation  and  recent 
events  in  Lebanon,  Granada  and  the  Persian  Gulc  this  could 
be  a  significant  explanatory  factor  for  the  continued  rise 
in    reenlistments. 

In  terms  of  policy  implications,  the  results  for  all  of 
the  models  utilized  indicate  that  high  levels  oc  first  term 
retention  are  likely  to  continue  in  all  of  these  ratings  for 
the  next  six  to  twelve  months.  At  this  point,  de;isions  will 
be  reguired  on  how  to  deal  with  these  increases  in  a  service 
that  is  rapidly  approaching  authorized  end  strength.  The 
longer  range  forecasts  still  seem  to  indicate  that  the 
currently  favorable  climate  will  eventually  givi  way  to  an 
ever  improving  economy.  Now  would  seem  to  be  the  most  oppor- 
tune time  to  take  advantage  of  the  situation  bf  increasing 
the  total  number  of  personnel  in  the  career  force  as  a  hedge 
against  the  future  change  in  the  demographics  of  the  cohort 
eligible  for  military  service.  This  will  indice  a  short 
term  increase  in  compensation  costs  by  increasing  the  inven- 
tory of  career  petty  officers.  This  will  evsntually  be 
offset  by  the  reduction  in  future  training  anl  recruiting 
costs    that    will    result   from    this    larger    career   fDrce. 
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C.       SUGGESTIONS     FOR    FUTURE    RESEARCH 

As      previously      stated,  the        fall      potential      of      the 

Box-Jenkins  method  has  not  been  fully  exploited  because  of 
restrictions  in  the  amount  of  data  available.  Firther,  this 
restriction  can  only  be  corrected  with  the  passage  of  time 
as  more  observations  become  available.  The  modeLs  presented 
in  this  paper  were  rating-specific,  which  ma/  only  have 
limited  application.  In  a  broad  sease,  however,  research 
should  continue  along  these  lines  with  aggregate  models  of 
rating  groups.  As  to  how  this  aggregation  should  be 
performed,  Thomas  and  Liao  [Ref-  3],  have  suggested  grouping 
ratings  by  observed  reenlistment  behavior  in  thsir  model  of 
second  and  subsequent  term  careerists.  This  grouping  should 
be  conducive  to  application  of  Box-Jenkins  techniques  which 
appears  to  be  more  effective  when  dealing  with  a  data  set  of 
narrow  range.  Another  possibility  for  grouping  rould  follow 
the  level  of  skill  required  as  indicated  by  a  eating  being 
termed    high-tech,    medium    tech    or    low    tech    [Ref.    1  ]. 

The  combined  regression,  Box-Jenkins  model  presented 
here  also  deserves  future  consideration  as  it  appears  to  be 
a  viable  "fine  tuning"  method  for  regression  models  and 
intuitively  more  appealing  than  introducing  more  and  more 
variables  into  the  analysis.  Use  of  a  combined  modeling 
technique  can  only  serve  to  strengthen  the  results  of 
regression    models   that  are    currently    very    popular . 

The  Box-Jenkins  method  is  not  meant  to  be  ai  all  encom- 
passing method  for  use  in  manpower  modeling.  [ t  certainly 
should,  however,  be  considered  as  a  tool  to  be  placed  in  the 
arsenal  of  the  manpower  planner  for  continued  us3  and  devel- 
opment. In  view  of  the  many  commercial  software  packages 
available  for  this  technique,  implementation  and  application 
to  manpower  issues  should  most  strongly  be  considered  for 
use    in   Navy   manpower    planning. 
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APPENDIX    A 
SUMMARY   OF    DATA    USED    IN    ANALYSIS 


TABLE    VII 
DATA    UTILIZED    FOE    ANALYSIS 


Rating 


Date 

Oct    8  0 

Nov 

Dec 

Jan 

Feb 

Mar 

Apr 

May 

Jun 

Jul 

Aug 

Sep 

Oct 

Nov 

Dec 

Jan 

Feb 

Mar 

Apr 

May 

Jun 

Jul 

Aug 

Sep 

Oct 

Nov 

Dec 

Jan 

Feb 

Mar 

Apr 

May 

Jun 

Jul 

Aug 

Sep 

Oct 

Nov 

Dec 

Jan 

Feb 

Mar 


80 
80 


8 


8 


82 
82 
82 
82 
82 
82 
82 
82 
82 
82 
82 
82 
83 
83 
83 
83 
83 
83 
83 
83 
83 
83 
83 
83 
84 
84 
84 


YN 


43.2 

50. 

66.1 

46. 

58.6 

34. 

43.2 

60. 

44.1 

40.7 

42.5 

36 

41 

70 

64 

52 

50 

54 

58 

70. 

63.9 

52.5 

50. 

39. 

61.7 

48.3 

58.6 

52.5 

68.1 

74.7 

68.1 

65.9 

66.3 

53.9 

67.4 

82.8 

60.7 

55.6 

66. 

68.8 

59.1 

51.8 


SK 


36 

53 

53 

30 

41 

43.7 

44.4 

67.2 

37.3 

26.9 


2 
4 
6 
3 
6 


28 

32 

34 

61 

62. 

59.2 

41.  1 

63.8 

46.4 

43.3 

49.  1 

37.  1 

42.6 

37.5 

55.8 

55.9 

62.5 

37. 

69.8 

55.8 

50. 

75. 

55.8 

53.  2 

70.2 

68.8 

64.6 

69.8 

70.8 

72. 

66.7 

75. 


OS 


5.1 

68.  1 

55. 

43. 

51. 

45.3 

63.2 

68.  1 

49.  1 

37.3 

30.6 

20.8 

49.5 

68.5 

53.  1 

50. 

63. 

74.  1 

55. 

62.5 

46. 

23.  1 

5.7 

2.3 

29.8 

37.8 

29.4 

39. 

44.4 

37.9 

54.5 

47. 

40.3 

22.8 

53.8 

46.5 

41.5 

41.2 

5  2.  6 

52.7 

48.9 

62.5 


77.7 
85.  1 

80.2 

73.6 

88.6 

82.2 

88.5 

87.  1 

80.8 

93.7 

94.3 

92.5 

93.8 

95.9 

93.8 

96.7 

97. 

97.1 


90, 

97. 

91. 

96 

92 


85.8 

91.8 

93. 

97.3 

87.9 
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APPENDIX    B 
THE    BOX-JENKINS    METHOD 

A.       OVERVIEW 

The  Box-Jenkins  procedure  can  be  used  to  fit  and  fore- 
cast time  series  data  by  means  of  a  general  class  of  statis- 
tical models.  An  observation  at  a  given  point  in  time  is 
modeled  as  a  function  of  its  past  values  and/or  current  and 
past  values  of  the  random  errors,  both  at  seasonal  and  non- 
seasonal  lags.  Box-Jenkins  methodology  will  modaL  a  variable 
with  observations  equally  spaced  in  time  and  no  missing 
values.  Sometimes  it  may  be  necessarry,  before  nodeling  the 
series,  to  transform  the  data  by  taking  the  log  function, 
square  root,  power  of  the  series  or  to  difference  the  series 
on    a    seasonal    or    non-seasonal    basis. 

The  modeling  of  time  series  data  is  usuaLly  done  in 
three  steps.  First,  identify  a  tentative  raoiel  for  the 
series.  Second,  estimate  the  parameters3  and  examine  the 
diagnostic  plots  and  statistics.  Third,  if  tie  model  is 
deemed  acceptable,  utilize  the  procedure  for  forecasting.  If 
the  model  is  inadequate,  return  to  step  one  and  evaluate  the 
time  series  for  more  appropriate  models  until  aa  acceptable 
one  is  found.  Fig.  B. 1  illustrates  the  steps  required  in 
Eox-Jenkins   analysis. 

The  advantage  gained  in  using  Box-Jenkins  analysis  is 
that  it  allows  the  data  to  speak  for  itself  siice  it  is  a 
univariate  procedure  and  therefore  does  not  alio*  for  expla- 
natory   variables.    The   underlying    more    restrictiva    assumption 


3For  most  software  packages  applied  to  Box-Jenkins 
modeling,  the  estimation  or  parameters  is  done  aa tomatically 
in  the  program  leaving  the  researcher  free  to  concentrate  on 
analysis    of    the    resulting    statistics. 
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throughout  all  Box-Jenkins  procedures  is  that  the  time 
series  will  eventually  repeat  itself  [Ref.  7],  oc  that  there 
is    some    pattern    underlying    the   data. 

B.  WHAT   IS    A    TIME    SERIES 

A  time  series  is  a  collection  of  observations  generated 
sequentially  over  time  at  specific  intervals  sur h  as  hours, 
days,  weeks,  months  or  years.  In  addition,  a  certain  depen- 
dence is  supposed  from  one  period  to  the  next.  It  is  this 
interdependence  that  is  of  value  when  trying  to  forecast 
future  activity  for  a  time  series.  Examples  of  time  series 
abound  in  fields  ranging  from  business  to  physics  and  are 
applied  to  analyze  monthly  sales  for  a  company,  quarterly 
yields  on  fiduciary  notes  or  the  chemical  yield  :>  f  a  certain 
substance  in  a  controlled  procedure.  A  time  series  can  also 
be  used  to  analyze  observations  that  are  either  discrete  or 
continuous;  by  way  of  example,  a  discrete  time  series  would 
be  the  closing  stock  price  of  a  company  and  i  continuous 
time  series  would  be  the  temperature  at  the  weather  center. 
In  summary,  a  discrete  time  series  is  one  where  observations 
exist  at  a  point  in  time  and  a  continuous  time  series  has 
potential  observations  at  all  points  in  time.  For  purposes 
of  this  discussion,  only  discrete  time  series  are  be 
addressed. 

C.  STATIONARITY 

In  pursuing  a  time  series  model,  the  first  assumption  to 
be  made  in  the  analysis  is  that  the  data  set  is  stationary. 
By  this  it  is  meant  that  the  observations  oscillate  around  a 
constant  mean  that  shows  no  growth  over  time.  Deviations 
about  this  mean  are  temporary  and  in  the  long  run  display 
equilibrium  about  the  mean  [Ref.  4]. 
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Figure     B. 1  THE    BOX-JENKINS    METHODOLOGY 
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A  further  measure  of  stationarity  can  be  gaiied  from  the 
autocorrelation  of  the  time  series,  that  is  the  correlation 
between  successive  observations  from  the  same  di  ta  set.  An 
observation  at  time  t,  denoted  by  Zt,  when  correlated  with 
an  observation  Zt+1  from  the  same  data  set  is  said  to 
produce  an  autocorrelation.  The  autocorrelation  is  measured 
by  pk  and  provides  important  information  about  tie  nature  of 
the  data  set.  A  value  close  to  +1  indicates  a  high  degree  of 
positive  correlation  between  observations,  while  a  value 
close  to  -1  indicates  a  high  negative  correlation . 

Most  time  series  are  not  stationary  and  reguire  some 
type  of  transformation  prior  to  analysis. 

D.   TIME  SERIES  PLOT 

The  first  step  in  determining  whether  a  tine  series  is 
stationary  or  not  is  to  construct  a  time  series  plot  of  the 
data  which  plots  observations  against  time  in  a a  attempt  to 
visually  determine  any  obvious  patterns  in  the  data.  Fig. 
B.2  illustrates  the  United  States  gross  national  product  for 
the  years  1947  through  1970  on  a  quarterly  basis.  This  plot 
shows  a  clear  upward  trend  in  the  data  which  ia dicates  the 
data  is  not  stationary,  further  within  each  year  there  are 
apparently  recurring  patterns  for  each  quarter  that  repeat 
on  annual  basis.  Finally,  as  time  passes,  the  variance  in 
GNP  tends  to  become  larger  and  more  volatile.  CLearly,  this 
data  set  must  be  transformed  prior  to  further  analysis  by 
the  Box-Jenkins  method. 


E.   DATA  TRANSFORMATION 

To  continue  with  the  example  of  GNP,  there  are  several 
possible  transformations  that  can  be  used  to  ini uce  statio- 
narity.  The  first  step  is  to   induce  a  constant  variance  in 


64 


19""0 


Figure    B.2         DNITED    STATES    GNP    1947    -    1970. 


the      data,         this      can      be      accomplished      either 
logarithmic    or    a    square    root    transformation.     Fig. 
trates    the    results       of   a   square   root    transform      d 
The    trend   is  still    clearly    present    bat    the    variaa 
smoothed   considerably. 

Once  variance  has  been  stabilized,  the  next. 
remove  the  trend.  There  are  several  sophistici 
sion  techniques  available  to  accomplish  this  ho 
method  of  differencing  will  be  the  only  one  add: 
For  a  more  detailed  discussion  of  these  alternat 
ques,  the  reader  is  directed  to  Makridakis  and 
[Ref.    4]. 

The  method  of  differencing  a  time  series  : 
subtracting  the  values  of  the  time  series  from  ea 
a  specified   order.    By  way   of    example,    consider   ta 
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Figure    B„3         SQUARE    BOOT    TRANSFORMATION    OF    U.S.     3  NP    1947    -    1970, 

data  set;  1,3,5,7,9,11.  When  plotted,  this  set  has  an 
obvious  linear  trend,  a  first  order  difference  involving 
subtraction  of  the  first  observation  from  the  second,  the 
second  from  the  third  and  so  on  results  in  the  transformed 
series  *, 2, 2,2, 2 ,2.  By  taking  the  first  difference,  the 
trend  disappears  and  yields  a  stationary  data  set.  The  **' 
indicates  that  whenever  a  data  set  is  differs  need,  one 
observation   is    lost      for   each    difference    operator .  Due   to 

random  fluctuations  in  real  data,  such  clear  cut  results  as 
those  illustrated  should  not  be  anticipated  however,  for  the 
majority  of  data  sets  diffsrencing  will  induce  a  sufficient 
amount    of  stationarity   to   proceed    with    further  analysis. 
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F.  AUTOCORRELATION  AMD  PARTIAL  AUTOCORRELATION 

A  useful  tool  in  model  estimation  is  the  autD correlation 
function  (ACF) ,  which  can  be  defined  as  the  association  or 
mutual  dependence  between  values  of  the  same  variable  but  at 
different  time  periods.  These  ACF  coefficients  provide 
valuable  information  about  a  data  set  and  any  pattern  that 
may  be  present.  If,  for  example,  a  high  positive  coefficient 
appeared  every  twelve  mouths,  a  seasonal  trsnd  may  be 
considered    to   exist. 

The  partial  autocorrelation  function,  (PACF)  ,  is  another 
and  complimentary  measure  to  be  applied  along  rfith  the  ACF 
to  aid  in  determining  model  type.  PACF's  are  analogous  to 
ACF's  in  that  they  indicate  the  relationship  oc  the  values 
of  a  time  series  to  various  time  lagged  values  of  the  same 
series.  They  differ  from  ACF's,  however,  in  ti  at  they  are 
computed  for  sach  time  lag  after  removing  the  effect  of  all 
other  time  lags.  In  essence,  they  show  the  relative  strength 
of    the    relationship    that   exists    for    varying    time    lags. 

When  the  ACF  and  the  PACF  are  analyzed  together,  they 
provide  a  very  powerful  tool  for  initial  modeL  selection. 
Fig.  B.4  through  Fig.  B.6  summarize  the  general  shapes  asso- 
ciated   with   the    different   types    of    models. 

G.  THE    AUTOREGRESSIVE    MODEL 

A  time  series  is  said  to  be  governed  by  an  autoregres- 
sive,  (AR) ,  if  the  current  value  of  the  time  series  can  be 
expressed  as  a  linear  function  of  the  previous  value  or 
values  plus  some  error  term  or  random  shock  valie  [Ref.  8]. 
The  assumptions  made  here  are  that  the  data  set  is 
stationary  and  the  error  terms  are  normally  aad  indepen- 
dently distributed  with  a  mean  of  zero  and  constant  vari- 
ance. A  check  on  the  adequacy  of  the  model  is  t o  construct 
an    ACF    for    ths    residuals   of      the    model    and    deternine    if    they 
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are    random    in   nature.      Mathematically,    an    AR     (1)     model    is    of 
the    form: 


Z 


•t        =    *    Zt-1         +    at  (egn    B*1) 


where   <J>    is    equal    to    the      autoregressi ve    coef f iris nt    and    a    is 
the    random    error   or    shock   term. 

H.        THE    MOVING    AVERAGE    MODEL 

A  time  secies  is  said  to  be  governed  by  a  mo/ing  average 
process  if  the  current  value  of  the  time  secies  can  be 
expressed  as  a  linear  function  of  the  current  error  term  and 
previous  error  term(s).  The  same  restrictions  i pply  to  the 
error  terms  of  am  MA  model  as  applied  to  the  AR  model. 
Mathematically,    this   function    can    be    expressed   as; 

Z^        =    a^_t    -    9ar  (eqn    B.2) 

where   6   is    the    moving   average   coefficient    and    a    ls    again    the 
error    term. 

I.        THE    MIXED    ADTOREGRESSIVE    MOVING    AVERAGE    MODEL 

The  mixed  autoregressi ve  moving  average  moiel  contains 
elements  of  both  the  AR  and  the  MA  procedures  and  expresses 
the  relationship  of  a  current  observation  as  a  linear  func- 
tion of  both  past  values  and  past  errors  of  the  /ariable.  As 
with  the  AR  and  MA  models,  the  residuals  of  the  model  are 
evaluated  for  adequacy  by  utilizing  the  ACF  function.  The 
equation    for  the    ARMA   model    is: 
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=  <t> x    zt.i 


+   a 


-  ea 


■c-  I 


(eqn    B.3) 


J.        EVALUATING    THE    MODEL 

Once  a  model  has  been  selected,  there  are  several  ways 
to  check  the  results  for  adequacy.  For  purposes  of  this 
discussion,    the    following   checks    will    be    addressed: 

1 .  ACF   of   residuals 

2.  Minimum    sura    of   squares 

3.  t-ratio 

There  are  several  other  checks  for  adequacy  that  are  avail- 
able to  the  user  of  Box-Jenkins  methodology,  for  a  more 
comprehensive  explanation  the  reader  is  directed  to  Vandaele 
[Ref.  '8]. 
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Figure    B.5         TYPICAL    FORM    OF    MA1    MODEL    ACF   AS D    PACF. 

1  •       ACF   of    The    Residuals 

As  mentioned  throughout  this  discussion,  the  ACF  for 
a  model  should  be  random  about  the  mean  zero  with  constant 
variance    and  a    magnitude   less    than    two    standard   errors. 

2 .  Minimum    Su ra    of   Squares 

Determination  of  this  measure  can  only  be  achieved 
by  comparison  with  other  potential  models.  In  some  cases, 
several  models  may  produce  insignificantly  different  sums  of 
squares  which  will  test  the  user's  judgment  and  application 
of    the   other   measures   of   adequacy. 

3 .  t- Ratio 

In  time  series  analysis  this  is  computed  by  dividing 
the    estimate  of    the    parameter    for      the   model    by    the    standard 
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deviation  for  the  series.  The  rales  as  appliel  to  regres- 
sion analysis  still  held  in  that  the  value  shouli  he  greater 
than  +/-2.0  in  order  to  indicate  that  the  coafficient  is 
significantly   different    from   zero. 


K.        PARSIMONY 

In  the  event  that  more  than  one  model  is  capable  of 
satisfying  the  acceptance  criteria  described  ibove,  the 
principle  of  parsimony  will  then  apply.  This  states  that 
when  faced  with  several  sufficient  model  types,  select  the 
the  lowest  order  model  available  that  sat  isfies  the 
criteria. 

L.       TRANSFER   FUNCTION    MODELS 

Also  known  as  multivariate  autoregressive  integrated 
moving  average,  (MARIMA),  or  leading  indicator  models.  This 
involves  selection  of  an  appropriate  univariate  model  for 
what  is  to  be  the  independent  variable  and  appLying  it  to 
the  dependent  variable.  The  application  of  the  model  will 
result  in  two  sets  of  residuals  which  when  cross  correlated 
at  different  time  lags  will  yield  the  cross  correlation 
function,  (CCF).  This       differs    slightly      frara       the      ACF 

discussed  earlier  in  that  we  now  expect  to  find  a  relation- 
ship of  significant  magnitude  at  various  poii ts  in  the 
comparison.  This  positive  correlation  is  an  indication  that 
there  is  a  significant  relationship  between  the  independent 
and    the    dependent   variable    at    certain    lags    or    lei ds    in    time. 
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Figure  B. 6    TYPICAL  FORM  OF  ARMA  1,1  MODEL  ACF  AND  PACF. 
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APPENDIX    C 
SOHMARY    OF    BOX-JENKINS    MODELS    DSED    FOR    ANALYSIS 

A.        MODELS    USED    IN    UNIVARIATE    ANALYSIS 

1  -      Yeoman 

Model    Type   -    ARIMA    (1,0,0) 
Model    Equation 


Z      =    .44 


,zr-l 


+    a. 


(eqn    C.1) 


T-ratio  -    2. 64 
Model    Residuals   -    Random 
2 .       Storekeepers 

Model    Type   -    ARIMA    (0,1,1) 
Model    Equation 


Z      =   -.814a,..,     +   at 


(eqn    C.2) 


T-ratio  -    7.24 


Model  Residuals  -  Random 


73 


3-      Operations    Specialists 

Model    Type    -    ARIMA     (1,0,0) 
Modal    Equation 


Z      =    .499Zt_(     +    ar 


(eqn    C.3) 


T-ratio   -    3.39 
Model    Residuals  -    Random 
4 •       Electronics    Technicians 

Model    Type    -   ARIMA    (1,0,0) 
Model    Equation 


Z      =    .588Zr_,     +    a 


(eqn    C.4) 


T-ratio  -    4. 1 


Model    Residuals   -    Random 
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5-       Eoiier    Technicians 

Model    Type    -    ARIMA     (1,0,0) 
Model    Equation 

Z      =    .  54  1ZX_(     +    ax  (eqn   C.5) 

T-ratio   -    3.63 

Model    Residuals   -    Random 

B.       BOX-JENKINS    MODELS    OF    REGRESSION    RESIDUALS 

1 .       Yeoman 

Model    Type    -    ARIMA    (0,0,1) 
Model    Equation 

Z      =   -.376at__j      ♦    ar  (eqn    C.6) 

T-ratio  -    -2-  19 

Model  Residuals  -  Random 
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2 •       St orekee gers 

Model    Type    -    ARIMA    (1,1,0) 
Model    Equation 

Z      =    .498^,     +ar  (eqn    C.7) 

T-ratio   -    -3.00 
Model    Residuals   -    Random 
3«      Operations    Specialists 

Model    Type    -   ARIMA    (1,1,1) 
Model    Equation 

Z      =   .60UZt_(     +    at   -    .956at.j  (eqn   C.8) 

T-ratio   -    AR1    -    3.28 
MA1    -    9.75 

Model    Residuals   -    Random 
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Electronics    Technicians 

Model    Type   -    ARIMA    (1,0,1) 
Model    Equation 


Z      =    .854Zr   .     +    a      -.529a__,  (eqn    C.9) 


T-ratio   -    AP.1    -    5.21 
MA1    -    2.04 

Model    Residuals   -    Random 

5 •       Boiler    Technicians 

Model    Type    -    ARIMA     (1,0,0) 

Model    Equation 

Z      =    .  590Zt        +aT  (eqn    C.10) 

T-ratio  -    3.70 

Model    Residuals   -    Random 
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